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1 Abstract 

2 Premise of the study: The teosinte branchedl (tbl) gene is a major QTL controlling branching 

3 differences between maize and its wild progenitor, teosinte. The insertion of a transposable 

4 element (Hopscotch) upstream of tbl is known to enhance the gene's expression, causing reduced 

5 tillering in maize. Observations of the maize tbl allele in teosinte and estimates of an insertion 

6 age of the Hopscotch that predates domestication led us to investigate its prevalence and 

7 potential role in teosinte. 

8 Methods: Prevalence of the Hopscotch element was assessed across an Americas-wide sample of 

9 837 maize and teosinte individuals using a co-dominant PCR assay. Population genetic summaries 

10 were calculated for a subset of individuals from four teosinte populations in central Mexico. 

11 Phenotypic data were also collected using seed from a single teosinte population where Hopscotch 

12 was found segregating at high frequency. 

13 Key results: Genotyping results indicate the Hopscotch element is found in a number of teosinte 

14 populations and linkage disequilibrium near tbl does not support recent introgression from maize. 

15 Population genetic signatures are consistent with selection on this locus revealing a potential 

16 ecological role for Hopscotch in teosinte, but a greenhouse experiment does not detect a strong 

17 association between tbl and tillering in teosinte. 

18 Conclusions: Our findings suggest the role of Hopscotch differs between maize and teosinte. 

19 Future work should assess tbl expression levels in teosinte with and without the Hopscotch and 

20 more comprehensively phenotype teosinte to assess the ecological significance of the Hopscotch 

21 insertion and, more broadly, the tbl locus in teosinte. 

22 Key words: domestication; maize; teosinte; teosinte branchedl ; transposable element 
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INTRODUCTION 



10 



Domesticated crops and their wild progenitors provide an excellent system in which to study 



3 adaptation and genomic changes associated with human-mediated selection (Ross- Ibarra et al. 
4 



2007). Plant domestication usually involves a suite of phenotypic changes such as loss of seed 



5 shattering and increased fruit or grain size, which are commonly referred to as the 'domestication 



6 syndrome' (Olsen and Wendel, 2013), and much of the study of domestication has focused on 



7 understanding the genetic variation underlying these traits (Olsen and Gross, 2010). Because 



8 most domesticates show reduced genetic diversity relative to their wild counterparts, effort has 



9 been made to identify agronomically useful variation in crop wild relatives (Flint-Garcia et al 



2009). Often, after identification, the alleles conferring these beneficial traits are bred into 



11 domesticates for crop improvement. For example, Oryza rufipogon, the wild progenitor of 

12 domesticated rice, has proven useful for the integration of a number of beneficial QTL controlling 



13 traits such as grain size and yield into domesticated rice (Kovach and McCouch 2008). In 



14 addition to researching the role of wild alleles in domesticates, researchers have also investigated 

15 the role of variation in domesticated taxa in the evolution of feral and weedy populations 



16 (Ellstrand et al. 2010). But even though domesticated alleles are often found segregating in wild 



17 relatives (Gallavotti et al. 2004 Sigmon and Vollbrecht, 2010), we know almost nothing about 



18 the ecological role of this variation in natural populations. In this paper we present an ecological 

19 genetic analysis of the domestication locus tbl, and specifically the domesticated haplotype at tbl, 

20 in natural populations of the wild ancestor of domesticated maize. 

21 Maize (Zea mays ssp. mays) was domesticated from the teosinte Zea mays ssp. parviglumis 



22 ^hereafter, parviglumis) roughly 9,000 B.P. in southwest Mexico (Piperno et al. 2009, Matsuoka 

23 |et al.[ |2002[ ) . Maize and the teosintes are an attractive system in which to study domestication 

24 due to the abundance of genetic tools developed for maize and well-characterized domestication 



25 loci (Hufford et al., 2012a Doebley, 2004 Hufford et al. 2012b). Additionally, large 



26 naturally-occurring populations of both parviglumis and the highland teosinte Zea mays ssp. 



27 mexicana (hereafter, mexicana) can be found throughout Mexico (Wilkes, 1977 Hufford et al. 
28 

29 Many morphological changes are associated with maize domestication, and understanding the 



2013), and genetic diversity of these taxa is estimated to be high (Ross-Ibarra et al. 2009). 



30 genetic basis of these changes has been a focus of maize research for a number of years (Doebley 



31 2004). One of the most dramatic changes is found in plant architecture: domesticated maize is 
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1 characterized by a central stalk with few tillers and lateral branches terminating in a female 

2 inflorescence, while teosinte is highly tillered and bears tassels (male inflorescences) at the end of 

3 its lateral branches. The teosinte branchedl (tbl) gene, a repressor of organ growth, was 



13 



4 identified as a major QTL involved in branching (Doebley et al. 1995) and tillering (Doebley and 
5 



Stec[ 1991) differences between maize and teosinte. A 4.9 kb retrotransposon (Hopscotch) 



6 insertion into the upstream control region of tbl in maize acts to enhance expression of tbl, thus 



7 repressing lateral organ growth (Doebley et al. 1997; Studer et al. 2011). Dating of the 

8 Hopscotch retrotransposon suggests that its insertion predates the domestication of maize, leading 

9 to the hypothesis that it was segregating as standing variation in populations of teosinte and 



10 increased to high frequency in maize due to selection during domestication (Studer et al. 2011). 



11 The effects of the Hopscotch insertion have been studied in maize (Studer et al. 2011), and 



12 ipiarysis of teosinte alleles at tbl has identified functionally distinct allelic classes (Studer and 



Doebley, 2012), but little is known about the role of tbl or the Hopscotch insertion in natural 



14 populations of teosinte. Previous studies have confirmed the presence of the Hopscotch in samples 

15 of parviglumis, mexicana, and landrace maize; however little is known about the frequency with 

16 which the Hopscotch is segregating in natural populations. 

17 In teosinte and other plants that grow at high population density, individuals detect 

18 competition from neighbors via the ratio of red to far-red light. An increase in far-red relative to 

19 red light accompanies shading and triggers the shade avoidance syndrome, a suite of physiological 

20 and morphological changes such as reduced tillering, increased plant height and early flowering 

21 ( |Kebrom and Brutnell 2007). The tbl locus appears to play an important role in the shade 



22 javoidance pathway in Zea mays (Lukens and Doebley, 1999) and other grasses (Kebrom and 

23 Brutnell, 2007) via changes in expression levels in response to shading. Lukens and Doebley 



24 ( 1999 ) introgressed the teosinte tbl allele into a maize inbred background and noted that under 

25 low density conditions plants were highly tillered but that under high density, plants showed 

26 significantly reduced tillers and grew taller. Based on these results we hypothesize that the 

27 Hopscotch (i.e., the domesticated allele) at tbl may play a role in the ecology of teosinte, 

28 especially in high-density populations. In this study we aim to characterize the distribution of the 

29 Hopscotch insertion in parviglumis, mexicana, and landrace maize, and to examine the phenotypic 

30 effects of the insertion in parviglumis. We use a combination of PCR genotyping for the 

31 Hopscotch element in our full panel and sequencing of two small regions upstream of tbl 

1 combined with a larger SNP dataset in a subset of teosinte populations to explore patterns of 
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2 genetic variation at this locus. Finally, we test for an association between the Hopscotch element 

3 and tillering phenotypes in samples from a natural population of parviglumis. 

4 MATERIALS AND METHODS 

5 Sampling and genotyping — We sampled 1,110 individuals from 350 accessions (247 maize 

6 landraces, 17 mexicana populations, and 86 parviglumis populations; ranging from 1-38 

7 individuals per population, with an average of 11 individuals per population for parviglumis and 

8 mexicana and 2 individuals per landrace accession) and assessed the presence or absence of the 

9 Hopscotch insertion (Appendix 1 and Appendix 2, See Supplemental Materials with the online 
10 version of this article). DNA was extracted from leaf tissue using a modified CTAB approach 



11 J |Doyle and Doylef |1990[ |Maloof et al.[|l984| ). We designed primers using PRIMER3 ( |Rozen and 

12 Skaletsky, 2000) implemented in Geneious (Kearse et al. 2012) to amplify the entire Hopscotch 



13 element, as well as an internal primer allowing us to simultaneously check for possible PCR bias 

14 between presence and absence of the Hopscotch insertion due to its large size (~5kb). Two PCRs 

15 were performed for each individual, one with primers flanking the Hopscotch (HopF/HopR) and 

16 one with a flanking primer and an internal primer (HopF/HopIntR). Primer sequences are HopF, 

17 5'-TCGTTGATGCTTTGATGGATGG-3'; HopR, 5'-AACAGTATGATTTCATGGGACCG-3'; and 

18 HopIntR, 5'-CCTCCACCTCTCATGAGATCG3' (Appendix 3 and Appendix 4, See Supplemental 

19 Materials with the online version of this article). Homozygotes show a single band for absence of 

20 the element (~300bp) and two bands for presence of the element (~5kb, amplification of the 

21 entire element, and ~l.lkb, amplification of part of the element), whereas heterozygotes show all 

22 three bands (Appendix 2, See Supplemental Materials with the online version of this article). 

23 Since we developed a PCR protocol for each allele, if only one PCR resolved well, we scored one 

24 allele for that individual rather than infer the diploid genotype. We used Phusion High Fidelity 

25 Enzyme (Thermo Fisher Scientific Inc., Waltham, Massachusetts, USA) and the following 

26 conditions for amplifications: 98°C for 3 min, 30 cycles of 98°C for 15 s, 65°C for 30 s, and 72°C 

27 for 3 min 30 s, with a final extension of 72°C for 10 min. PCR products were visualized on a 1% 

28 agarose gel and scored for presence/absence of the Hopscotch based on band size. 

29 Genotyping analysis — To calculate differentiation between populations (Fst) an d 



30 subspecies (Fct) we used HierFstat (Goudet, 2005). These analyses only included populations 



31 (n = 32) in which eight or more chromosomes were sampled. To test the hypothesis that the 

32 Hopscotch insertion may be adaptive under certain environmental conditions, we looked for 
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1 significant associations between Hopscotch frequency and environmental variables using the 



2 software BayEnv (Coop et al. 2010). BayEnv creates a covariance matrix of relatedness between 

3 populations and then tests a null model that allele frequencies in populations are determined by 

4 the covariance matrix of relatedness alone against the alternative model that allele frequencies are 

5 determined by a combination of the covariance matrix and an environmental variable, producing 

6 a posterior probability (i.e., Bayes Factor; Coop et aL||2010 ). We used teosinte (ssp. parviglumis 



12 



7 and ssp. mexicana) genotyping and covariance data from Pyhajarvi et al. (2013) for BayEnv, 



8 n with the Hopscotch insertion coded as an additional biallelic marker. SNP data from Pyhajarvi 
9 



et al. (2013) were obtained using the MaizeSNP50 BeadChip and Infinium HD Assay (Illumina, 



10 San Diego, CA, USA). Environmental data were obtained from www.worldclim.org and soil data 



11 were downloaded from the Harmonized World Soil Database ( FAQ /HAS A /ISRIC /ISSC AS / JRC 



2012) at www.harvestchoice . org} Environmental data represent average values for the last 



13 several decades (climatic data) or are likely stable over time (soil data) and therefore represent 

14 conditions important for local adaptation of our samples. Information from these data sets was 



15 summarized by principle component analysis following Pyhajarvi et al. (2013). 

16 Sequencing — In addition to genotyping, we chose a subset of parviglumis individuals for 

17 sequencing. We chose twelve individuals from each of four populations from Jalisco state, Mexico 

18 (San Lorenzo, La Mesa, Ejutla A, and Ejutla B). For amplification and sequencing, we selected 

19 two regions approximately 600bp in size from within the 5' UTR of tbl (Region 1) and from 

20 l,235bp upstream of the start of the Hopscotch (66,169bp upstream from the start of the tbl 



21 ORF; Region 2). We designed the following primers using PRIMER3 (Rozen and Skaletsky, 2000): 

22 for the 5' UTR, 5-'GGATAATGTGCACCAGGTGT-3' and 5'-GCGTGCTAGAGACACYTGTTGCT-3'; 

23 for the 66kb upstream region, 5'-TGTCCTCGCCGCAACTC-3' and 

24 5'-TGTACGCCCGCCCCTCATCA-3' (Appendix 1, See Supplemental Materials with the online 

25 version of this article). We used Taq polymerase (New England Biolabs Inc., Ipswich, 

26 Massachusetts, USA) and the following thermal cycler conditions to amplify fragments: 94° C for 

27 3 min, 30 cycles of 92°C for 40 s, annealing for 1 min, 72°C for 40 s, and a final 10 min extension 

28 at 72°C. Annealing temperatures for Region 1 and Region 2 were 59.7°C and 58.8°C, respectively. 

29 To clean excess primer and dNTPs we added two units of Exonucleasel and 2.5 units of Antarctic 

30 Phosphatase to 8.0 ^L of amplification product. This mix was placed on a thermal cycler with 

31 the following program: 37°C for 30 min, 80°C for 15 min, and a final cool-down step to 4°C. 

32 We cloned cleaned fragments into a TOPO-TA vector (Life Technologies, Grand Island, New 
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1 York, USA) using OneShot TOP 10 chemically competent E. coli cells, with an extended ligation 

2 time of 30 min for a complex target fragment. We plated cells on LB agar plates containing 

3 kanamycin, and screened colonies using vector primers M13 Forward and M13 Reverse under the 

4 following conditions: 96°C for 5 min; then 35 cycles at 96°C for 30 s, 53°C for 30 s, 72°C for 2 

5 min; and a final extension at 72° C for 4 min. We visualized amplification products for 

6 incorporation of our insert on a 1% agarose TAE gel. 

7 Amplification products with successful incorporation of our insert were cleaned using 

8 Exonuclease 1 and Antarctic Phosphatase following the procedures detailed above, and sequenced 

9 with vector primers M13 Forward and M13 Reverse using Sanger sequencing at the College of 

10 Agriculture and Environmental Sciences (CAES) sequencing center at UC Davis. We aligned and 



11 trimmed primer sequences from resulting sequences using the software Geneious (Kearse et al. 
12 



2012). Following alignment, we verified singleton SNPs by sequencing an additional one to four 



13 colonies from each clone. If the singleton was not present in these additional sequences it was 

14 considered an amplification or cloning error, and we replaced the base with the base of the 

15 additional sequences. If the singleton appeared in at least one of the additional sequences we 

16 considered it a real variant and kept it for further analyses. 

17 Sequence analysis — For population genetic analyses of sequenced Region 1 and sequenced 

18 Region 2 we used the Libsequence package ( Thornton , |2003 ) to calculate pairwise F$t between 

19 populations and to calculate standard diversity statistics (number of haplotypes, haplotype 

20 diversity, Watterson's estimator 6w, pairwise nucleotide diversity 6 n , and Tajima's D). To 

21 produce a visual representation of differentiation between sequences and examine patterns in 

22 sequence clustering by Hopscotch genotype we used Phylip 



23 (http://evolution.genetics.washington.edu/phylip.html) to create neighbor-joining trees 

24 with bootstrap-supported nodes (100 repetitions). For creation of trees we also included 



25 homologous sequence data from Maize HapMapV2 (Chia et al. 2012) for teosinte inbred lines 

26 (TILs), some of which are known to be homozygous for the Hopscotch insertion (TIL03, TIL17, 

27 TIL09), as well as 59 lines of domesticated maize. 

28 Introgression analysis — In order to assess patterns of linkage disequilibrium (LD) around 



29 jjthe Hopscotch element in the context of chromosomal patterns of LD we used Tassel (Bradbury 

30 et al. 2007) and calculated LD between SNPs across chromosome 1 using previously published 



31 data from twelve plants each of the Ejutla A (EjuA), Ejutla B (EjuB), San Lorenzo (SLO), and 

32 La Mesa (MSA) populations (Pyhajarvi et al. 2013[ ). We chose these populations because we had 
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1 both genotyping data for the Hopscotch as well as chromosome-wide SNP data for chromosome 1. 

2 For each population we filtered the initial set of 5,897 SNPs on chromosome 1 to accept only 

3 SNPs with a minor allele frequency of at least 0.1, resulting in 1,671, 3,023, 3,122, and 2,167 



6 



12 



14 



4 SNPs for SLO, EjuB, EjuA, and MSA, respectively. We then used Tassel ( |Bradbury et al.[ |2007[ ) 

5 to calculate linkage disequilibrium (r 2 ) across chromosome 1 for each population. 



We examined evidence of introgression on chromosome 1 in these same four populations 



7 (EjuA, EjuB, MSA, SLO) using STRUCTURE (| Falush et aL| |2003| ) and phased data from 
8 



Pyhajarvi et al. (2013), combined with the corresponding SNP data from a diverse panel of 282 



9 maize lines (Cook et al. 2012). SNPs were anchored in a modified version of the IBM genetic 



10 map (Gerke et al. 2013). Since STRUCTURE does not account for LD due to physical linkage we 



11 created haplotype blocks using a custom Perl script from Hufford et al. (2013, code available at 



http://dx.doi . org/10 . 6084/m9 . f igshare . 1165577[). In maize, LD decays over an average 



13 ^istance of 5500bp (Chia et al. 2012); because LD decay is even more rapid in teosinte (Chia 



et al. , 2012 ) we used a conservative haplotype block size of 5kb. We ran STRUCTURE at K=2 



15 under the linkage model, performing 3 replicates with an MCMC burn-in of 10,000 steps and 

16 50,000 steps post burn-in. 

17 Phenotyping of parviglumis — To investigate the phenotypic effects of the Hopscotch 

18 insertion in teosinte we conducted a phenotyping trial in which we germinated 250 seeds of 



19 parviglumis collected in Jalisco state, Mexico (population San Lorenzo; Hufford 2010) where the 

20 Hopscotch insertion is segregating at highest frequency (0.44) in our initial genotyping sample set. 

21 In order to maximize the likelihood of finding the Hopscotch in our association population we 

22 selected seeds from sites within the population where genotyped individuals were homozygous or 

23 heterozygous for the insertion. We chose between 10-13 seeds from each of 23 sampling sites. We 

24 treated seeds with Captan fungicide (Southern Agricultural Insecticides Inc., Palmetto, Florida, 

25 USA) and germinated them in petri dishes with filter paper. Following germination, 206 

26 successful germinations were planted into one-gallon pots with potting soil and randomly spaced 

27 one foot apart on greenhouse benches. Plants were watered three times a day with an automatic 

28 drip containing 10-20-10 fertilizer, which was supplemented with hand watering on extremely hot 

29 and dry days. 

30 Starting on day 15, we measured tillering index as the ratio of the sum of tiller lengths to the 



31 height of the plant (Briggs et al. 2007). Following initial measurements, we phenotyped plants for 

32 tillering index every 5 days through day 40, and then on day 50 and day 60. On day 65 we 
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1 measured culm diameter between the third and fourth nodes of each plant. Following 

2 phenotyping we extracted DNA from all plants using a modified SDS extraction protocol. We 

3 genotyped individuals for the Hopscotch insertion following the PCR protocols listed above. 

4 Tillering index data for each genotypic class did not meet the criteria for a repeated measures 

5 ANOVA, so we transformed the data with a Box-Cox transformation (A = 0) in the Car Package 



6 for R (Fox and Weisberg, 2011) to improve the normality and homogeneity of variance among 

7 genotype classes. We analyzed relationships between genotype and tillering index and tiller 

8 number using a repeated measures ANOVA through a general linear model function implemented 

9 in SAS v.9.3 (SAS Institute Inc., Cary, NC, USA). Additionally, in order to compare any 

10 association between Hopscotch genotype and tillering and associations at other presumably 

11 unrelated traits, we performed an ANOVA between culm diameter and genotype using the same 

12 general linear model in SAS. Culm diameter is not believed to be correlated with tillering index 

13 or variation at tbl and is used as our independent trait for phenotyping analyses. SAS code used 



14 for analysis is available at http://dx.doi.org/10.6084/m9.figshare.1166630 



is RESULTS 

16 Genotyping — The genotype at the Hopscotch insertion was confirmed with two PCRs for 

17 837 individuals of the 1,100 screened. Among the 247 maize landrace accessions genotyped, all 

18 but eight were homozygous for the presence of the insertion (Appendix 1 and Appendix 2, See 

19 Supplemental Materials with the online version of this article). Within our parviglumis and 

20 mexicana samples we found the Hopscotch insertion segregating in 37 (n = 86) and four (n = 17) 

21 populations, respectively, and at highest frequency within populations in the states of Jalisco, 

22 Colima, and Michoacan in central- western Mexico (Fig. [I]). Using our Hopscotch genotyping, we 

23 calculated differentiation between populations (Fst) and subspecies (Fct) for populations in 

24 which we sampled eight or more individuals. We found that Fct = 0, and levels of Fst among 

25 populations within each subspecies (0.22) and among all populations (0.23) are similar to 

26 genome- wide estimates from previous studies (Pyhajarvi et al.||2013[ Table [Q. Although we found 



27 large variation in Hopscotch allele frequency among our populations, BayEnv analysis did not 

28 indicate a correlation between the Hopscotch insertion and environmental variables (all Bayes 

29 Factors < 1). 

30 Sequencing — To investigate patterns of sequence diversity and linkage disequilibrium (LD) 

31 in the tbl region, we sequenced two small (<lkb) regions upstream of the tbl ORF in four 



10 



Downloaded from http://biorxiv.org/on September 18, 2014 



1 populations. After alignment and singleton checking we recovered 48 and 40 segregating sites for 

2 the 5' UTR region (Region 1) and the 66kb upstream region (Region 2), respectively. For Region 

3 1, Ejutla A has the highest values of haplotype diversity and O^, while Ejutla B and La Mesa have 

4 comparable values of these summary statistics, and San Lorenzo has much lower values. 

5 Additionally, Tajima's D is strongly negative in the two Ejutla populations and La Mesa, but is 

6 closer to zero in San Lorenzo (Table [2j Appendix 2, See Supplemental Materials with the online 

7 version of this article). For Region 2, haplotype diversity and 9 n , are similar for Ejutla A and 

8 Ejutla B, while La Mesa and San Lorenzo have slightly lower values for these statistics (Table [2]). 

9 Tajima's D is positive in all populations except La Mesa, indicating an excess of low frequency 

10 variants in this population (Table [2]). Pairwise values of Fst within population pairs Ejutla 

11 A/Ejutla B and San Lorenzo/La Mesa are close to zero for both sequenced regions as well as for 

12 the Hopscotch, while they are high for other population pairs (Table Q. Neighbor joining trees of 

13 our sequence data and data from the teosinte inbred lines (TILs; data from Maize HapMapV2, 
14 



Chia et al. 2012) do not reveal any clear clustering pattern with respect to population or 



15 Hopscotch genotype (Appendix 5, See Supplemental Materials with the online version of this 

16 article); individuals within our sample that have the Hopscotch insertion do not group with the 

17 teosinte inbred lines or domesticated maize that have the Hopscotch insertion. 

18 Evidence of introgression — The highest frequency of the Hopscotch insertion in teosinte 

19 was found in parviglumis sympatric with cultivated maize. Our initial hypothesis was that the 

20 high frequency of the Hopscotch element in these populations could be attributed to introgression 

21 from maize into teosinte. To investigate this possibility we examined overall patterns of linkage 

22 disequilibrium across chromosome one and specifically in the tbl region. If the Hopscotch is found 

23 in these populations due to recent introgression we would expect to find large blocks of linked 

24 markers near this element. We find no evidence of elevated linkage disequilibrium between the 

25 Hopscotch and SNPs surrounding the tbl region in our resequenced populations (Figure [2]), and 

26 r 2 in the tbl region does not differ significantly between populations with (average r 2 of 0.085) 

27 and without (average r 2 = 0.082) the Hopscotch insertion. In fact, average r 2 is lower in the tbl 

28 region (r 2 = 0.056) than across the rest of chromosome 1 (r 2 = 0.083; Table [3]). 

29 The lack of clustering of Hopscotch genotypes in our NJ tree as well as the lack of LD around 

30 tbl do not support the hypothesis that the Hopscotch insertion in these populations of parviglumis 

31 is the result of recent introgression. However, to further explore this hypothesis we performed a 

32 STRUCTURE analysis using Illumina MaizeSNP50 data from four of our parviglumis populations 
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1 ||(EjuA, EjuB, MSA, and SLO) ( |Pyhajarvi et al.[ [20l3[ ) and the maize 282 diversity panel ( |Cook 

2 |et al.[|2012[ ). The linkage model implemented in STRUCTURE can be used to identify ancestry of 

3 blocks of linked variants which would arise as the result of recent admixture between populations. 

4 If the Hopscotch insertion is present in populations of parviglumis as a result of recent admixture 

5 with domesticated maize, we would expect the insertion and linked variants in surrounding sites 

6 to be assigned to the "maize" cluster in our STRUCTURE runs, not the "teosinte" cluster. In all 

7 runs, assignment to maize in the tbl region across all four parviglumis populations is low (average 

8 0.017) and much below the chromosome- wide average (0.20; Table U Fig. [~~). 

9 Phenotyping — To assess the contribution of tbl to phenotypic variation in tillering in a 

10 natural population, we grew plants from seed sampled from the San Lorenzo population of 

11 parviglumis, which had a high mean frequency (0.44) of the Hopscotch insertion based on our 

12 initial genotyping. We measured tiller number and tillering index, the ratio of the sum of tiller 

13 lengths to plant height, for 206 plants from within the San Lorenzo population, and geno typed 

14 plants for the Hopscotch insertion. We also measured culm diameter, a phenotype that differs 



15 between maize and teosinte (Briggs et al., 2007) but is not thought to be affected by the 



16 Hopscotch insertion. Phenotypic data are available at 

17 |http : / /dx . doi . org/ 10* . 6084/m9 . f igshare . 776926 , Our plantings produced 82 homozygotes 

18 for the Hopscotch insertion at tbl, 104 heterozygotes, and 20 homozygotes lacking the insertion; 

19 these numbers do not deviate from expectations of Hardy- Weinberg equilibrium. After 

20 performing a repeated measures ANOVA between our transformed tillering index data and 

21 Hopscotch genotype we find no significant correlation between genotype at the Hopscotch 

22 insertion and tillering index (Fig. [4]), tiller number, or culm diameter. Only on day 40 did we 

23 observe a weak but statistically insignificant (r 2 = 0.02, p = 0.0848) correlation between tillering 

24 index and the Hopscotch genotype, although in the opposite direction of that expected, with 

25 homozygotes for the insertion showing a higher tillering index. 

26 DISCUSSION 



27 Adaptation occurs due to selection on standing variation or de novo mutations. Adaptation 

28 from standing variation has been well-described in a number of systems; for example, selection for 



29 lactose tolerance in humans (Plantinga et al. 2012, Tishkoff et al. 2007), variation at the Eda 



30 locus in three-spined stickleback (Kitano et al. 2008; Colosimo et al. 2005), and pupal diapause 



31 in the Apple Maggot fly (Feder et al. 2003). Although the adaptive role of standing variation has 
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1 been described in many systems, its importance in domestication is not as well studied. 



In maize, alleles at domestication loci (RAMOSA1, Sigmon and Vollbrecht 2010 barren 



25 



27 



3 stalk 1, Gallavotti et al. 2004; and grassy tillers 1, Whipple et al. 2011) are thought to have been 

4 selected from standing variation, suggesting that diversity already present in teosinte may have 

5 played an important role in maize domestication. The teosinte branchedl gene is one of the best 

6 characterized domestication loci, and, while previous studies have suggested that differences in 

7 plant architecture between maize and teosinte are a result of selection on standing variation at 



8 this locus (Clark et al. 2006 Studer et al. 2011), much remains to be discovered regarding 

9 natural variation at this locus and its ecological role in teosinte. 
10 



Studer et al. (2011) genotyped 90 accessions of teosinte (inbred and outbred), providing the 



11 first evidence that the Hopscotch insertion is segregating in teosinte. Given that the Hopscotch 

12 insertion has been estimated to predate the domestication of maize, it is not surprising that it can 

13 be found segregating in populations of teosinte. However, by widely sampling across teosinte 

14 populations our study provides greater insight into the distribution and prevalence of the 



15 Hopscotch in teosinte. While our findings are consistent with Studer et al. (2011) in that we 

16 identify the Hopscotch allele segregating in teosinte, we find it at higher frequency than 

17 previously suggested. The Hopscotch allele is more prevalent in parviglumis than in mexicana in 

18 our sample (Appendix 1, See Supplemental Materials with the online version of this article), 

19 suggesting a different history of the allele amongst teosinte subspecies. Moreover, many of our 

20 parviglumis populations with a high frequency of the Hopscotch allele fall in the Jalisco cluster 



21 identified by Fukunaga et al. (2005), and further distinguish this region from the Balsas River 



22 Basin where maize was domesticated (Matsuoka et al. 2002). Potential explanations for the high 

23 frequency of the Hopscotch element in parviglumis from the Jalisco cluster include gene flow from 

24 maize, genetic drift, and natural selection. 



While gene flow from crops into their wild relatives is well-known, (Ellstrand et al. 1999 



Zhang et al. 


2009 


Thurber et al. 


2010 


Baack et al. 


2008 Hubner et al. 


2012 


Wilkes, 


1977; 


van 



Heerwaarden et al. 2011 Barrett, 1983), our results do not suggest introgression from maize at 



28 the tbl locus, and are more consistent with Hufford et al. (2013) who found resistance to 

29 introgression from maize into mexicana around domestication loci. Clustering in our NJ trees 

30 does not reflect the pattern expected if maize alleles at the tbl locus had introgressed into 

31 populations of teosinte. Moreover, there is no signature of elevated LD in the tbl region relative 

32 to the rest of chromosome 1, and Bayesian assignment to a maize cluster in this region is both low 
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1 and below the chromosome- wide average (Fig. [5J Table [4]). Together, these data point to an 

2 explanation other than recent introgression for the high observed frequency of Hopscotch in the 

3 Jalisco cluster of our parviglumis populations. 

4 Although recent introgression seems unlikely, we cannot rule out ancient introgression as an 

5 explanation for the presence of the Hopscotch in these populations. If the Hopscotch allele was 

6 introgressed in the distant past, recombination may have broken up LD, a process that would be 

7 consistent with our data. We find this scenario less plausible, however, as there is no reason why 

8 gene flow should have been high in the past but absent in present-day sympatric populations. In 

9 fact, early generation maize-teosinte hybrids are common in these populations today (MB 

10 Hufford, pers. observation), and genetic data support ongoing gene flow between domesticated 



23 



11 maize and both mexicana and parviglumis in a number of sympatric populations (Hufford et al. 



2013 Ellstrand et al., 2007 van Heerwaarden et al., 2011 Warburton et al., 2011). 



12 

13 Remaining explanations for differential frequencies of the Hopscotch among teosinte 

14 populations include both genetic drift and natural selection. Previous studies using both SSRs 

15 and genome-wide SNP data have found evidence for a population bottleneck in the San Lorenzo 

16 population (Hufford, 2010 Pyhajarvi et al.| 2013), and the lower levels of sequence diversity in 

17 this population in the 5' UTR (Region 1) coupled with more positive values of Tajima's D are 

18 consistent with these earlier findings. Such population bottlenecks can exaggerate the effects of 

19 genetic drift through which the Hopscotch allele may have risen to high frequency entirely by 

20 chance. A bottleneck in San Lorenzo, however, does not explain the high frequency of the 

21 Hopscotch in multiple populations in the Jalisco cluster. Moreover, available information on 



22 diversity and population structure among Jaliscan populations (Hufford 2010; Pyhajarvi et al. 



2013 ) is not suggestive of recent colonization or other demographic events that would predict a 



24 high frequency of the allele across populations. Finally, diversity values in the 5' UTR of tbl are 

25 suggestive of natural selection acting upon the gene in populations of parviglumis. Overall 

26 nucleotide diversity is 76% less than seen in the sequences from the 66kb upstream region, and 

27 Tajima's D is considerably lower and consistently negative across populations (Table [2]). In fact, 

28 values of Tajima's D in the 5' UTR are toward the extreme negative end of the distribution of 



29 this statistic previously calculated across loci sequenced in parviglumis (Wright et al. 2005 
30 



Moeller et al. 2007). Though not definitive, these results are consistent with the action of 



31 selection on the upstream region of tbl, perhaps suggesting an ecological role for the gene in 

32 Jaliscan populations of parviglumis. 
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Significant effects of the Hopscotch insertion on lateral branch length, number of cupules, and 



11 



2 tillering index in domesticated maize have been well documented (Studer et al. 2011). Weber 
3 



et al. (2007) described significant phenotypic associations between markers in and around tbl and 



4 lateral branch length and female ear length in a sample from 74 natural populations of 



5 parviglumis (Weber et al. 2007); however, these data did not include markers from the Hopscotch 

6 region 66kb upstream of tbl. Our study is the first to explicitly examine the phenotypic effects of 

7 the Hopscotch insertion across a wide collection of individuals sampled from natural populations 

8 of teosinte. We have found no significant effect of the Hopscotch insertion on tillering index or 

9 tiller number, a result that is discordant with its clear phenotypic effects in maize. One 



10 interpretation of this result would be that the Hopscotch controls tillering in maize (Studer et al. 



2011), but tillering in teosinte is affected by variation at other loci. Consistent with this 



12 interpretation, tbl is thought to be part of a complex pathway controlling branching, tillering and 



13 other phenotypic traits (Kebrom and Brutnell, 2007 Clark et al. 2006). 

14 A recent study by Studer and Doebley (2012) examined variation across traits in a three-taxa 



15 allelic series at the tbl locus. Studer and Doebley (2012) introgressed nine unique teosinte tbl 

16 segments (one from Zea diploperennis, and four each from mexicana and parviglumis) into an 

17 inbred maize background and investigated their phenotypic effects. Pheno types were shown to 

18 cluster by taxon, indicating tbl may underlie morphological diversification of Zea. Additional 



19 analysis in Studer and Doebley (2012) suggested tillering index was controlled both by tbl and 

20 loci elsewhere in the genome. Clues to the identity of these loci may be found in QTL studies 



21 that have identified loci controlling branching architecture (e.g., Doebley and Stec 1991, 1993). 

22 Many of these loci (grassy tillers, gtl; tassel-replaces-upper-earsl, trul; terminal earl, tel) have 



23 been shown to interact with tbl (Whipple et al. 2011 Li, 2012), and both trul and tel affect the 



1 same phenotypic traits as tbl (Doebley et al. 1995). trul, for example, has been shown to act 

2 either epistatically or downstream of tbl, affecting both branching architecture (decreased apical 

3 dominance) and tassel phenotypes (shortened tassel and shank length and reduced tassel number; 



Li 2012). Variation in these additional loci may have affected tillering in our collections and 



5 contributed to the lack of correlation we see between Hopscotch genotype and tillering. 

6 In conclusion, our findings demonstrate that the Hopscotch allele is widespread in populations 

7 of parviglumis and mexicana and occasionally at high allele frequencies. Analysis of linkage using 

8 SNPs from across chromosome 1 does not suggest that the Hopscotch allele is present in these 

9 populations due to recent introgression, and it seems unlikely that the insertion would have 
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10 drifted to high frequency in multiple populations. We do, however, find preliminary evidence of 

11 selection on the tbl locus in parviglumis. Coupled with our observation of high frequency of the 

12 Hopscotch insertion in a number of populations, this suggests that the locus may play an 

13 ecological role in teosinte. 

14 In contrast to domesticated maize, the Hopscotch insertion does not appear to have a large 

15 effect on tillering in a diverse sample of parviglumis from a natural population and the phenotypic 

16 consequences of variation at tbl thus remain unclear. Future studies should examine expression 

17 levels of tbl in teosinte with and without the Hopscotch insertion and further characterize the 

18 effects of additional loci involved in branching architecture (e.g. gtl, trul, and tel). These data, 

19 in conjunction with more exhaustive phenotyping, should help to further clarify the ecological 

20 significance of the domesticated tbl allele in natural populations of teosinte. 



16 



Downloaded from http://biorxiv.org/on September 18, 2014 



21 LITERATURE CITED 

22 Baack, E., Y. Sapir, M. Chapman, J. Burke, and L. Rieseberg. 2008. Selection on 

23 domestication traits and quantitative trait loci in crop-wild sunflower hybrids. Mol Ecol 17: 

24 666-677. 

25 Barrett, S. 1983. Crop mimicry in weeds. Econ Bot 37: 255-282. 

26 Bradbury, P., Z. Zhang, D. Kroon, T. Casstevens, Y. Ramdoss, and E. Buckler. 

27 2007. Tassel: software for association mapping of complex traits in diverse samples. 

28 Bioinformatics 23: 2633-2635. 

29 Briggs, W., M. McMullen, B. Gaut, and J. Doebley. 2007. Linkage mapping of 

1 domestication loci in a large maize-teosinte backcross resource. Genetics 177: 1915-1928. 

2 Chia, J., C. Song, P. Bradbury, D. Costich, N. de, Leon, J. Doebley, R. Elshire, 

3 B. Gaut, L. Geller, J. Glaubitz, M. Gore, K. Guill, J. Holland, M. Hufford, 

4 J. Lai, M. Li, X. Liu, Y. Lu, R. McCombie, R. Nelson, J. Poland, B. Prasanna, 

5 T. Pyhajarvi, T. Rong, R. Sekhon, Q. Sun, M. Tenaillon, F. Tian, J. Wang, X. Xu, 

6 Z. Zhang, S. Kaeppler, J. Ross-Ibarra, M. McMullen, E. Buckler, G. Zhang, 

7 Y. Xu, and D. Ware. 2012. Maize hapmap2 identifies extant variation from a genome in flux. 

8 Nat Genet 44: 803-U238. 

9 Clark, R., T. Wagler, P. QuiJADA, and J. Doebley. 2006. A distant upstream enhancer at 

10 the maize domestication gene tbl has pleiotropic effects on plant and inflorescent architecture. 

11 Nat Genet 38: 594-597. 

12 COLOSIMO, P., K. HOSEMANN, S. BALABHADRA, G. VlLLARREAL, M. DlCKSON, 

13 J. Grimwood, J. Schmutz, R. Myers, D. Schluter, and D. Kingsley. 2005. 

14 Widespread parallel evolution in sticklebacks by repeated fixation of ectodysplasin alleles. 

15 Science 307: 1928-1933. 

16 Cook, J., M. McMullen, J. Holland, F. Tian, P. Bradbury, J. Ross-Ibarra, 

17 E. Buckler, and S. Flint-Garcia. 2012. Genetic architecture of maize kernel composition 

18 in the nested association mapping and inbred association panels. Plant Physiol 158: 824-834. 

19 Coop, G., D. Witonsky, A. Di, Rienzo, and J. Pritchard. 2010. Using environmental 

20 correlations to identify loci underlying local adaptation. Genetics 185: 1411-1423. 

17 



Downloaded from http://biorxiv.org/on September 18, 2014 

21 Doebley, J. 2004. The genetics of maize evolution. Annu Rev Genet 38: 37-59. 

22 Doebley, J. and A. Stec. 1991. Genetic-analysis of the morphological differences between 

23 maize and teosinte. Genetics 129: 285-295. 

24 Doebley, J. and A. Stec. 1993. Inheritance of the morphological differences between maize 

25 and teosinte: Comparison of results for two F2 populations. Genetics 134: 559-570. 

26 Doebley, J., A. Stec, and C. Gustus. 1995. teosinte branchedl and the origin of maize: 

27 Evidence for epistasis and the evolution of dominance. Genetics 141: 333-346. 

1 Doebley, J., A. Stec, and L. Hubbard. 1997. The evolution of apical dominance in maize. 

2 Nature 386: 485-488. 

3 Doyle, J. and J. Doyle. 1990. A rapid total dna preparation procedure for small quantities of 

4 fresh tissue. Phytochemical Bulletin 19: 11-15. 

5 Ellstrand, N., L. Garner, S. Hegde, R. Guadagnuolo, and L. Blancas. 2007. 

6 Spontaneous hybridization between maize and teosinte. Journal of Heredity 98: 183-187. 

7 Ellstrand, N., H. Prentice, and J. Hancock. 1999. Gene flow and introgression from 

8 domesticated plants into their wild relatives. Annu Rev Ecol Syst 30: 539-563. 

9 Ellstrand, N. C., S. M. Heredia, J. A. Leak-Garcia, J. M. Heraty, J. C. Burger, 

10 L. Yao, S. Nohzadeh-Malakshah, and C. E. Ridley. 2010. Crops gone wild: evolution of 

11 weeds and invasives from domesticated ancestors. Evolutionary Applications 3: 494-504. 

12 Falush, D., M. Stephens, and J. Pritchard. 2003. Inference of population structure using 

13 multilocus genotype data: Linked loci and correlated allele frequencies. Genetics 164: 

14 1567-1587. 

15 FAO/IIASA/ISRIC/ISSCAS/JRC. 2012. Harmonized World Soil Database, version 1.2. FAO, 

16 Rome, Italy and IIASA, Laxenburg, Austria. 

17 Feder, J., S. Berlocher, J. Roethele, H. Dambroski, J. Smith, W. Perry, 

18 V. Gavrilovic, K. Filchak, J. Rull, and M. Aluja. 2003. Allopatric genetic origins for 

19 sympatric host-plant shifts and race formation in rhagoletis. P Natl Acad Sci Usa 100: 

20 10314-10319. 



18 



Downloaded from http://biorxiv.org/on September 18, 2014 



21 Flint-Garcia, S. A., A. L. Bodnar, and M. P. Scott. 2009. Wide variability in kernel 

22 composition, seed characteristics, and zein profiles among diverse maize inbreds, landraces, and 

23 teosinte. Theoretical and applied genetics 119: 1129-1142. 

24 Fox, J. and S. Weisberg. 2011. An R Companion to Applied Regression, vol. Second Edition. 

25 Sage, Thousand Oaks, CA. 

26 Fukunaga, K., T. Nussbaum-Wagler, B. Li, Q. Zhao, Y. Vigouroux, M. Faller, 

27 K. Bomblies, L. Lukens, and J. Doebley. 2005. Genetic diversity and population 

28 structure of teosinte. Genetics 169: 2241-2254. 

1 Gallavotti, A., Q. Zhao, J. Kyozuka, R. Meeley, M. Ritter, J. Doebley, M. Pe, and 

2 R. Schmidt. 2004. The role of barren stalkl in the architecture of maize. Nature 432: 630-635. 

3 Gerke, J., J. Edwards, G. KE, J. Ross-Ibarra, and M. McMullen. 2013. The genomic 

4 impacts of drift and selection for hybrid performance in maize. arXiv 1307.7313. 

5 Goudet, J. 2005. Hierfstat, a package for r to compute and test hierarchical f-statistics. Mol 

6 Ecol Notes 5: 184-186. 

7 Hubner, S., T. Gunther, A. Flavell, E. Fridman, A. Graner, A. Korol, and 

8 K. Schmid. 2012. Islands and streams: clusters and gene flow in wild barley populations from 

9 the levant. Mol Ecol 21: 1115-1129. 

10 Hufford, M. 2010. Genetic and ecological approaches to guide conservation of teosinte (zea 

11 mays ssp. parviglumis) , the wild progenitor of maize. PhD Dissertation : 130pp. 

12 Hufford, M., P. Bilinski, T. Pyhajarvi, and J. Ross-Ibarra. 2012a. Teosinte as a model 

13 system for population and ecological genomics. Trends in Genetics 12: 606-615. 

14 Hufford, M., P. Lubinsky, T. Pyhajarvi, M. Devengenzo, N. Ellstrand, and 

15 J. Ross-Ibarra. 2013. The genomic signature of crop- wild introgression in maize. PLoS 

16 Genetics 9: el003477. 

17 Hufford, M., X. Xu, J. van, Heerwaarden, T. Pyhajarvi, J. Chia, R. Cartwright, 

18 R. Elshire, J. Glaubitz, K. Guill, S. Kaeppler, J. Lai, P. Morrell, L. Shannon, 

19 C. Song, N. Springer, R. Swanson- Wagner, P. Tiffin, J. Wang, G. Zhang, 

20 J. Doebley, M. McMullen, D. Ware, E. Buckler, S. Yang, and J. Ross-Ibarra. 

19 



Downloaded from http://biorxiv.org/on September 18, 2014 



21 2012b. Comparative population genomics of maize domestication and improvement. Nat Genet 

22 44: 808-U118. 

23 Kearse, M., R. Moir, A. Wilson, S. Stones-Havas, M. Cheung, S. Sturrock, 

24 S. Buxton, A. Cooper, S. Markowitz, C. Duran, T. Thierer, B. Ashton, 

25 P. Meintjes, and A. Drummond. 2012. Geneious basic: An integrated and extendable 

26 desktop software platform for the organization and analysis of sequence data. Bioinformatics 

27 28: 1647-1649. 

1 Kebrom, T. and T. Brutnell. 2007. The molecular analysis of the shade avoidance syndrome 

2 in the grasses has begun. Journal of Experimental Botany 58: 3079-3089. 

3 Kitano, J., D. Bolnick, D. Beauchamp, M. Mazur, S. Mori, T. Nakano, and 

4 C. Peichel. 2008. Reverse evolution of armor plates in the threespine stickleback. Curr Biol 

5 18: 769-774. 

6 Kovach, M. and S. McCouch. 2008. Leveraging natural diversity: back through the 

7 bottleneck. Genome studies and Molecular Genetics 11: 193-200. 

8 Li, W. 2012. Tassels replace upper earsl encodes a putative transcription factor that regulates 

9 maize shoot architecture by multiple pathways. PhD Dissertation : 122. 

10 Lukens, L. and J. Doebley. 1999. Epistatic and environmental interactions for quantitative 

11 trait loci involved in maize evolution. Genet Res 74: 291-302. 

12 Maloof, M., K. Soliman, R. Jorgensen, and R. Allard. 1984. Ribosomal dna spacer 

13 length polymorphisms in barley - mendelian inheritance, chromosomal location, and population 

14 dynamics. P Natl Acad Sci Usa 81: 8014-8018. 

15 Matsuoka, Y., Y. Vigouroux, M. Goodman, G. Sanchez, E. Buckler, and 

16 J. Doebley. 2002. A single domestication for maize shown by multilocus microsatellite 

17 genotyping. P Natl Acad Sci Usa 99: 6080-6084. 

18 Moeller, D. A., M. I. Tenaillon, and P. Tiffin. 2007. Population structure and its effects 

19 on patterns of nucleotide polymorphism in teosinte (zea mays ssp. parviglumis). Genetics 176: 

20 1799-1809. 



20 



Downloaded from http://biorxiv.org/on September 18, 2014 



21 Olsen, K. and B. Gross. 2010. Genetic perspectives on crop domestication. Trends in Plant 

22 Science 15: 529-537. 

23 Olsen, K. M. and J. F. Wendel. 2013. A bountiful harvest: Genomic insights into crop 

24 domestication phenotypes. Annual Review of Plant Biology 64: 47-70. 

25 Piperno, D., A. Ranere, I. Holst, J. Iriarte, and R. Dickau. 2009. Starch grain and 

26 phytolith evidence for early ninth millennium bp maize from the central balsas river valley, 

1 mexico. P Natl Acad Sci Usa 106: 5019-5024. 

2 Plantinga, T., S. Alonso, N. Izagirre, M. Hervella, R. Fregel, J. VAN der Meer, 

3 M. Netea, and C. de la Rua. 2012. Low prevalence of lactase persistence in neolithic 

4 south-west europe. Eur J Hum Genet 20: 778-782. 

5 Pyha.ja.rvi, T., M. Hufford, and J. Ross-Ibarra. 2013. Complex patterns of local 

6 adaptation in the wild relatives of maize. Genome Biology and Evolution 5: 1594-1609. 

7 Ross-Ibarra, J., P. Morrell, and B. Gaut. 2007. Plant domestication, a unique 

8 opportunity to identify the genetic basis of adaptation. P Natl Acad Sci Usa 104: 8641-8648. 

9 Ross-Ibarra, J., M. Tenaillon, and B. Gaut. 2009. Historical divergence and gene flow in 

10 the genus zea. Genetics 181: 1399-1413. 

11 Rozen, S. and H. Skaletsky. 2000. Primer3 on the www for general users and for biologist 

12 programmers. Methods in Molecular Biology : 365-386. 

13 SiGMON, B. and E. Vollbrecht. 2010. Evidence of selection at the ramosal locus during 

14 maize domestication. Mol Ecol 19: 1296-1311. 

15 Studer, A. and J. Doebley. 2012. Evidence for a natural allelic series at the maize 

16 domestication locus teosinte branchedl. Genetics 19: 951-958. 

17 Studer, A., Q. Zhao, J. Ross-Ibarra, and J. Doebley. 2011. Identification of a functional 

18 transposon insertion in the maize domestication gene tbl. Nat Genet 43: 1160-U164. 

19 Thornton, K. 2003. libsequence: a C++ class library for evolutionary genetic analysis. 

20 Bioinformatics 19: 2325-2327. 



21 



Downloaded from http://biorxiv.org/on September 18, 2014 



21 Thurber, C, M. Reagon, B. Gross, K. Olsen, Y. Jia, and A. Caicedo. 2010. Molecular 

22 evolution of shattering loci in us weedy rice. Mol Ecol 19: 3271-3284. 

23 Tishkoff, S., F. Reed, A. Ranciaro, B. Voight, C. Babbitt, J. Silverman, K. Powell, 

24 H. MORTENSEN, J. HlRBO, M. OSMAN, M. IBRAHIM, S. OMAR, G. LEMA, T. NYAMBO, 

25 J. Ghori, S. Bumpstead, J. Pritchard, G. Wray, and P. Deloukas. 2007. Convergent 

26 adaptation of human lactase persistence in africa and europe. Nat Genet 39: 31-40. 

27 VAN HEERWAARDEN, J., J. DOEBLEY, W. BRIGGS, J. GLAUBITZ, M. GOODMAN, 

558 J. Gonzalez, and J. Ross-Ibarra. 2011. Genetic signals of origin, spread, and introgression 

559 in a large sample of maize landraces. P Natl Acad Set Usa 108: 1088-1092. 

560 Warburton, M. L., W. Garrison, S. Taba, A. Charcosset, C. Mir, F. Dumas, 

561 D. Madur, S. Dreisigacker, C. Bedoya, B. Prasanna, C. Xie, S. Hearne, and 

562 J. Franco. 2011. Gene flow among different teosinte taxa and into the domesticated maize 

563 gene pool. Genetic Resources and Crop Evolution 58: 1243-1261. 

564 Weber, A., R. Clark, L. Vaughn, J. Sanchez-Gonzalez, J. Yu, B. Yandell, 

565 P. Bradbury, and J. Doebley. 2007. Major regulatory genes in maize contribute to 

566 standing variation in teosinte (zea mays ssp parviglumis). Genetics 177: 2349-2359. 

567 Whipple, C, T. Kebrom, A. Weber, F. Yang, D. Hall, R. Meeley, R. Schmidt, 

568 J. Doebley, T. Brutnell, and D. Jackson. 2011. grassy tillersl promotes apical 

569 dominance in maize and responds to shade signals in the grasses. P Natl Acad Sci Usa 108: 

570 E506-E512. 

571 Wilkes, H. 1977. Hybridization of maize and teosinte, in mexico and guatemala and the 

572 improvement of maize. Economic Botany 31: 254-293. 

573 Wright, S. I., I. V. Bi, S. G. Schroeder, M. Yamasaki, J. F. Doebley, M. D. 

574 McMullen, AND B. S. Gaut. 2005. The effects of artificial selection on the maize genome. 

575 Science 308: 1310-1314. 

576 Zhang, L., Q. Zhu, Z. Wu, J. Ross-Ibarra, B. Gaut, S. Ge, and T. Sang. 2009. Selection 

577 on grain shattering genes and rates of rice domestication. New Phytol 184: 708-720. 



22 



Downloaded from http://biorxiv.org/on September 18, 2014 



Table 1. Pairwise Fst values from sequence and Hopscotch genotyping data 



Comparison 


Region 1 


Region 2 


Hopscotch 


EjuA & EjuB 


0 


0 


0 


EjuA & MSA 


0.326 


0.328 


0.186 


EjuA & SLO 


0.416 


0.258 


0.280 


EjuB & MSA 


0.397 


0.365 


0.188 


EjuB & SLO 


0.512 


0.290 


0.280 


MSA & SLO 


0.007 


0 


0.016 
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Table 2. Population genetic statistics from resequenced regions near the tbl locus 



Population 


# Haplotypes Hap. Diversity 


Q- 


Tajima's D 


Region 1(5' UTR) 


EJUA 


8 0.859 


0.005 


-1.650 


EJUB 


5 0.709 


0.004 


-1.831 


MSA 


6 0.682 


0.004 


-1.755 


SLO 


3 0.318 


0.001 


-0.729 


Region 2 (66kb upstream) 


EJUA 


8 0.894 


0.018 


0.623 


EJUB 


8 0.894 


0.016 


0.295 


MSA 


3 0.682 


0.011 


-0.222 


SLO 


4 0.742 


0.014 


0.932 



Table 3. mean r 2 values between SNPs on chromosome 1, in the broad tbl region, within the 5' 
UTR of tbl (Region 1), and 66kb upstream of tbl (Region 2). 



Population 


Chr. 1 


tbl region 


Region 1 


Region 2 


Ejutla A 


0.095 


0.050 


0.747 


0.215 


Ejutla B 


0.069 


0.051 


0.660 


0.186 


La Mesa 


0.070 


0.053 


0.914 


0.766 


San Lorenzo 


0.101 


0.067 


0.912 


0.636 
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Table 4. Assignments to maize and teosinte in the tbl and chromosome 1 regions from STRUC- 
TURE 



tbl region Chr 1 



Population 


Maize 


Teosinte 


Maize 


Teosinte 


Ejutla A 


0.022 


0.978 


0.203 


0.797 


Ejutla B 


0.019 


0.981 


0.187 


0.813 


La Mesa 


0.012 


0.988 


0.193 


0.807 


San Lorenzo 


0.016 


0.984 


0.205 


0.795 
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Figure 1. Map showing the frequency of the Hopscotch allele in populations of parviglumis where 
we sampled more than 6 individuals. Size of circles reflects number of alleles sampled. The Balsas 
River is shown, as the Balsas River Basin is believed to be the center of domestication of maize. 
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Figure 2. Linkage disequilibrium for SNPs in Mb 261-268 on chromosome 1. The yellow rectangle 
indicates the location of the Hopscotch insertion and the green represents the tbl ORF. A) Ejutla 
A; B) Ejutla B; C) La Mesa; D) San Lorenzo. The upper triangle above the black diagonal is 
colored based on the r 2 value between SNPs while the bottom triangle is colored based on p- value 
for the corresponding r 2 value. 



27 



Downloaded from http://biorxiv.org/on September 18, 2014 



tb1 region 




264 266 268 

Physical Position (Mb) 



Figure 3. STRUCTURE assignment to maize across a section of chromosome 1. The dotted lines 
mark the beginning of the sequenced region 66kb upstream (Region 2) and the end of the tbl ORF. 
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Figure 4. Box-plots showing tillering index in greenhouse grow-outs for phenotyping. White in- 
dicates individuals homozygous for the Hopscotch, light grey represents heterozygotes, and dark 
grey represents homozygotes for the teosinte (No Hopscotch) allele. Within boxes, dark black lines 
represent the median, and the edges of the boxes are the first and third quartiles. Outliers are 
displayed as dots, the maximum value excluding outliers is shown with the top whisker, while the 
minimum excluding outliers is shown with the bottom whisker. 
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